GLOBAL REGULARITY OF WAVE MAPS I. SMALL CRITICAL 
SOBOLEV NORM IN HIGH DIMENSION 
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Abstract. We show that wave maps from Minkowski space R 1+n to a sphere 
S m ~ 1 are globally smooth if the initial data is smooth and has small norm 
in the critical Sobolev space H n / 2 , in the high-dimensional case n > 5. A 
major difficulty, not present in earlier results in this area, is that the if™/ 2 
norm barely fails to control L°° , potentially causing a logarithmic divergence 
in the non-linearity; however this can be overcome by using co-ordinate frames 
adapted to the wave map by approximate parallel transport. In the sequel [ pq 
of this paper we address the more interesting two-dimensional case n = 2, 
which is energy-critical. 



1. Introduction 



Throughout this paper m > 2, n > 1 will be fixed integers, and all constants may 
depend on m and n. 

Let R 1+ " be n+1 dimensional Minkowski space with flat metric r\ :— diag(— 1, 1, . . . , 1), 
and let S™ -1 C R m denote the unit sphere in the Euclidean space R™\ Elements 
4> of R m will be viewed as column vectors, while their adjoints (ft are row vectors. 
We let d a and d a for a = 0, . . . , n be the usual derivatives with respect to the 
Minkowski metric r\. We let □ := d a d a — A — df denote the D'Lambcrtian. We 
shall also use 4> for dt<j>. 

Define a wave map to be any function (j> defined on an open set in R 1+ ™ taking 
values on the sphere S™ -1 which obeys the equation 

d a d a (j> = -4>d a (ftd a 4> (1) 

in the sense of distributions. In order to make sense of ([!]) we shall require <j> to be 
in C\L\ n C t °£Tj; in our high-dimensional context this regularity shall be easy to 
obtain. 

We shall define a classical wave map to be a wave map which is smooth and equal 
to a constant outside of a finite union of light cones. 

For any time t, we use <p[t] :— ((j>(t) , <j>(t)) to denote the position and velocity of (f> 
at time t. We refer to </>[0] as the initial data of (j). We shall always assume that 
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the initial data 4>[0] — (0(0), 0(0)) satisfies the consistency conditions 

0t( O )0(O) = l; 0t (Q )0(o) =O (2) 

(i.e. (j>[0) lies on the sphere). It is easy to show (e.g. by Gronwall's inequality) that 
this consistency condition is maintained in time, for smooth solutions at least. 

Let H s := (1 + y/^Ay s L 2 (R n ) denote the usuaf] L 2 Sobolev spaces. Since the 
equation (Q) is invariant under the scaling tp i— ► cf>\ defined by 

4>\{t,x) :=<p(t/X,x/X) 

we see that the critical regularity is s = n/2. 

The Cauchy problem for wave maps has been extensively studied (see references); 
we refer the interested reader to the surveys in ]T^|, Jl6[ , |23| , (26) . For sub-critical 
regularities s > n/2 it is known (see [OL |i~5|], ]10| for the n > 4, n — 2,3, and 
n = 1 cases respectively) that the Cauchy problem for ([!]) is locally well-posed for 
initial data <f)[0] in H s x H s ~ l , and the solution can be continued (without losing 
regularity) as long as the H B norm remains bounded. The critical result however is 
more subtle. Well-posedness and regularity was demonstrated in the critical Besov 
space B^ 2 in [^9| in the high-dimensional case n > 4 and in J3(J for n = 2, 3; 
in the one-dimensional case n = 1, a logarithmic cascade from high frequencies to 
low frequencies causes ill-posedness in the critical Besov and Sobolev spaces p7[ , 
although global smooth solutions can still be constructed thanks to the sub-critical 
nature of the energy norm (see e.g. 0])- As is to be expected at the critical 
regularity, these results give a global well-posedness (and regularity and scattering) 
when the norm of the initial data is small. 

The question still remains as to whether the wave map equation (Q) is well-posed 
in the critical Sobolev space H n / 2 := \J — A L 2 in two and higher dimensions, 
with global well-posedness and regularity expected for small data. This question is 
especially interesting in the two dimensional case, since the critical Sobolev space 
is then the energy norm H 1 , and one also expects to exploit conservation of energy 
(and some sort of energy non-concentration result) to obtain global well-posedness 
and regularity for data with large energy. (In dimensions three and higher one does 
not have large data global well-posedness for the sphere, even for smooth symmetric 
data; see p4|). However the Sobolev space H n l 2 is significantly less tractable than 
its Besov counterpart B^ 2 \ for instance, H n / 2 norm fails to control the L°° norm 
due to a logarithmic pile-up of frequencies. This logarithmic divergence is respon- 
sible for failure of any strengthened version of well-posedness (uniform, Lipschitz, 
or analytic) for wave maps at this regularity, as well as ill-posedness in very similar 
equations, and is a serious obstacle to any iteration-based argument. See |15| , ||l0[ , 
(l8| for further discussion. 

Our main result is the following. 



1 Strictly speaking, one cannot use H s spaces for functions on the sphere, since they cannot 
globally be in L 2 . To get around this we shall abuse notation and allow constant functions to lie 
in H s with zero norm whenever the context is for functions on the sphere. Thus when we say that 
4>(t) is in H a , we really mean that <j>(t) — c is in H s for some constant c. 
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Theorem 1.1. Let n > 5 and s > n/2, and suppose that the initial data <p[0] is in 
H s x H s ~ x and has sufficiently small if™/ 2 x if™/ 2-1 norm. Then the solution to 
the Cauchy problem for ([!]) with initial data <f>[0] can be continued in H s x H"^ 1 
globally in time. In particular, smooth solutions stay smooth when the initial data 
has small H n ' 2 X fl""/ 2 " 1 norm. Furthermore, if\s — n/2\ < 1/2, we have the global 
bounds^ 



Our argument also shows that <j> obeys the expected range of Strichartz estimates 
globally in spacetime, although we will not write down a precise statement here. 

Our arguments are heavily based on the geometric structure of the equation (|l|) ; in 
particular, they do not directly apply to the associated difference equation. As a 
consequence we have not been able to obtain a well-posedness^] result at the critical 
reg ularity iW 2 x if"/ 2 " 1 , even for small data. The argument also does not directly 
yield a scattering result, although this obstruction seems to be less serious. We will 
not pursue these matters. 

Our main tools are Littlewood-Paley decomposition and Strichartz estimates (as 
in pifl), combined with some geometric identities (such as <ftd a (f) = 0) and the 
use of a good co-ordinate frame constructed by approximate parallel transport; 
this renormalization is crucial in order to remove the possibility of logarithmic 
divergence. The high-dimensional case n > 5 is significantly easier than the low- 
dimensional cases, because of the strong decay of the wave equation (i -2 or better) 
as well as the rarity of parallel interactions (one expects interactions of angle 0(6) to 
only occur 0(8 n ^ 1 ) = 0(9 4 ) of the time). Because of these advantages, we shall not 
need to develop bilinear estimates to take advantage of the null form structure of ([!]) 
(cf. jnj), although we shall heavily exploit the geometric structure of this equation. 
Instead of bilinear estimates, we shall rely primarily on Strichartz estimates, and 
in particular on the L 2 i^°-type and L 2 L^-type estimates which are not available 
in low dimensions. With such strong estimates it shall be easy (after applying the 
renormalization) to obtain L\ L 2 -type estimates on the non-linearity, so that one 
can then close the argument by energy estimates. (Without the normalization, 
Strichartz estimates only work when one has half a derivative more than critical) . 

In the n — 4 case one loses the L 2 L 4 estimate, however one can introduce X s ' b 
spaces to obtain L 2 L 2 estimates on □(/>. This, together with the identity 

2d a <t>d a i) = □ (#) - <f>n(i/)) - n(4>)4, 

to exploit the null structure, should be able to cover the n — 4 case in analogy with 
the arguments in [ p9[ , however we have elected not to do this to keep the argument 
as simple as possible. 

2 We of course adopt the convention that A < B denotes the inequality A < CB for some 
constant C depending only on n, m. 

3 We should remark at this point that strong versions of well-posedness, such as uniform, 
Lipschitz. or analytic well-posedness, are known to fail at the critical Sobolev regularity (see Jlo| , 
Pjl). Presumably one would have to renormalize the difference equation in a manner strongly 
dependent on the initial data. 
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In the n = 2, 3 cases one also loses the L^L"^ estimate, which seems to defeat any 
attempt to prove these results purely by standard Strichartz estimates (although the 
n = 3 case is probably salvageable for radial data, thanks to the endpoint Strichartz 
estimate holding in that context). Fortunately, the low dimensional case has been 
effectively handled (in the Besov space case) by the more sophisticated arguments 
of J30| , using the additional ingredients of angular frequency decomposition and 
X s ^-ty-pe spaces (to more effectively exploit the null form structure in ([I])) and 
null frames (to recover estimates of L^L^ type). In the sequel pi to this paper 
we shall adapt the arguments in to cover the n = 2 small energy case, as well 
as the remaining cases n = 3, 4. 

The main novel ingredient in our approach is the use of adapted co-ordinate frames 
constructed by approximate parallel transport along (Littlewood-Paley regulariza- 
tions of) 4>. The construction presented here is heavily dependent of the geometry 
of the sphere, although this should in principle extend to other compact manifolds 
by using the machinery of Hclcin in his work j^] on harmonic maps^j. 

Without the use of these frames, the usual iteration approach for ([!]) fails at the crit- 
ical regularity because of a logarithmic pile-up of high-low frequency interactions. 
The effect of the adapted co-ordinate frame is to transform the high-low frequency 
interaction into other terms which are more tractable, such as high-high frequency 
interactions, or high-low interactions in which a derivative has been moved from a 
high-frequency term to a low-frequency one. 

In the remainder of this section we shall informally motivate the key ideas in the 
argument. In doing so we shall make frequent use of the following heuristic: if 
<p, tp are two functions, and ip is much rougher (i.e. higher frequency) than </>, 
then (V</>)V> is very small compared to cpVip- In other words, we should be able to 
neglect terms in which derivatives fail to fall on rough functions, and land instead 
on smooth ones. (Indeed, these terms can usually be treated just by Strichartz 
estimates). In particular, we expect to have V(0i/>) ~ 0VV> (which can be viewed 
as a statement that <j) is approximately constant when compared against tp). 

Let us suppose that our wave map <f> has the form (p = <p + etjj , where <p is a smooth 
wave map, < e <C 1 and -0 is a H n ^ 2 function which is much rougher than <f>. (In 
other words, <f> is a small rough perturbation of a smooth wave map). If we ignore 
terms which are quadratic or better in e, or which fail to differentiate the rough 
function tp, we obtain the linearized equation 



for tp. Also, since <j> and <p + sip both take values on the sphere we see that 



(again ignoring terms quadratic in e, and terms where the derivative fails to land 
on tp). 




(4) 



fill) = 0; ftd a il> = 



(5) 



4 Since the preparation of this manuscript, the author has learnt (Klainerman, personal com- 
munication) that the arguments here have been successfully extended to arbitrary Lie groups by 
Klainerman and Rodnianski. 
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In order to keep the H s norm of (f>+eip from blowing up, we need to prevent the H n / 2 
norm from being transferred from (f> to eip. In particular, we need Uj^H^ 2 bounds 
on ip which are independent of e. We would also like the corresponding Strichartz 
estimates for ip, in order to control the error terms that we have been ignoring. 
(This scheme is not restricted to rough perturbations of smooth wave maps, and 
will be adapted to general wave maps by use of Littlewood-Paley projections). 

Despite being linear, the equation (^) is not very well-behaved, having no obvious 
cancellation structure (beyond the null form, which is not particularly useful in 
the high-dimensional setting) . In order to iterate away the first-order terms on the 
right-hand side of (||) we would like (j)d a (j^ to be in L\L^ . In principle this might 
be feasible if we had the Strichartz estimate V 1 / 2 ^ S L 2 L^ : but this estimate 
just barely fails to hold because of a logarithmic divergence in the frequencies. 
However, if we could somehow ensure that the derivative in 4>d a (f>' < always fell on 
a low-frequency component of <fi and not on a high-frequency component then one 
would have a chance of iterating away the non- linearity^]. This will be accomplished 
by a renormalization using a co-ordinate frame adapted to (f>. 

We begin by taking advantage of (||) to rewrite (^) in a form reminiscent of parallel 
transport: 

d a d a 4> = 2A a d a ip (6) 

where A a is the matrix 

A a := d a (f>ft - (^daft. 

Note that @ exhibits more cancellation than (|J), as A a is now anti-symmetric. 
This type of trick is standard in the study of wave and harmonic maps, see e.g. 
Hi), g, §, etc. 



To solve (||), let us first consider the ODE analogue 

4> = 2A V>. (7) 

The matrix Aq is anti-symmetric. Thus if we let U(t) be the matrix-valued function 
solving the ODE 

U(t) = A U(t) 

with U(0) initialized to the identity matrix (say), then we see that -^(UU^) = 
and thus that U remains orthogonal for all time. Indeed, one can view U as the 
parallel transport of the identity matrix along the trajectory of <fi. Furthermore, 
since <p is smooth, we see that U is also smooth, and in particular is much smoother 
than tp. One can then use the linear change of variables "0 = Uw, and ignore terms 
which fail to differentiate the rough function w, to rewrite (^) as the trivial equation 
w = 0. 

The ODE example of (0) suggests that (|^) might be simplified by applying some 
orthogonal matrix U to the wave ip, or in other words by viewing ip in a carefully 
chosen co-ordinate frame. (This fits well with the corresponding experience of 



5 The author thanks Chris Sogge for this observation. 
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harmonic maps in [g). Ideally, we would like U to be carried by parallel transport 
by <j) in all directions. More precisely, we would like U to solve the PDE 

d a U — A a U (8) 

for each a. If we make the improbable assumption that U obeyed (@) exactly for 
all a, we can then substitute ip = Uw as before and ignore all terms which fail to 
differentiate the rough function w to transform @ to the free wave equation 

d Q d a w = 

which we of course know how to solve. 

Unfortunately, the system Q of PDE is overdetermined, and in general has no 
solution (since the parallel transport connection induced by </> will have a small^ 
but non-zero curvature). Nevertheless, it is possible to use Littlewood-Paley theory 
to construct a satisfactory approximate solution U to (Q). Specifically, we perform 
the Littlewood-Paley decomposition (f> — 4>-m + M<fe^ )fe ' wnere M is a large 
number, 0_a/ is the portion of (f> on frequencies |£| < 2~ M , and <pk is the portion 
on frequencies |£| ~ 2 k . We then define U = U-m + Yl-M<k Uk, wncrc U-m is the 
identity matrix, and the Uk are defined recursively by the formula 

U k ■= (&4 fc - <f><k4)U<k (9) 
where 4><k, f/<fc are the functions 

4><k ■= 4>-M + <i>k, U <k :=U- M + U k - 

-M<k'<k —M<k'<k 

It then transpires that the matrix U is approximately orthogonal and approximately 
satisfies (||), provided that the H n / 2 norm of U is sufficiently small and M is 
sufficiently large. The point is that (\>k is a rougher function than 4><ki an< i so one 
can (heuristically) neglect terms where the derivative falls on (j) < k instead of 4>k- 
Similarly for Uk and U <k - Thus we can morally differentiate (||) to obtain 

d a U k » (0^*^ ~ 4><kd a 4>l)U <k (10) 

and (JsJ) follows by summing the telescoping series (and continuing to neglect the 
same type of terms as before). The approximate orthogonality of U is based on 
the observation (from @) that UlU<k + U* <k Uk = 0. Summing this in k and 
telescoping, we obtain 

rfu = i+ ]T u\u k . 

k>-M 

The summation on the right-hand side then turns out to be negligible if we assume 
<j> is small in H n / 2 , since this implies from Sobolev embedding that the L°° norms 
of the 4>k (and hence the Uk) are small in I 2 . (A similar argument can be used 
to dispose of the error terms which were neg lected in @). If one then trans- 
forms @ using ip = Uw as before, we obtain a non-linear wave equation for w, 

6 More precisely, the curvature only contains terms which are quadratic in the first derivatives 
of <j>, as opposed to being linear in the second derivatives of tf>. This phenomenon seems specific to 
the wave maps equation; if one tries to apply the techniques here to (for instance) the Maxwell- 
Klein-Gordon or Yang-Mills equations at the critical Sobolev regularity, an obstruction arises 
because the connection A has no reason to have a good curvature, regardless of the choice of 
gauge. At best one can place these equations in the Coulomb gauge, which was already known to 
be the most useful gauge to study these equations. 
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but all the terms in the non-linearity either contain expressions such as UkUl 
which are quadratic^ in the frequency parameter k, or have all derivatives falling 
on smooth functions rather than rough ones. Both types of terms turn out to be 
easily controlled by Strichartz estimates. 

This work was conducted at UCLA, Tohoku University, UNSW, and the French 
Alps. The author thanks Daniel Tataru, Mark Keel, and Sergiu Klainerman for 
very helpful discussions, insights, and encouragement, and to Sergiu Klainerman, 
Kenji Nakanishi, Igor Rodnianski, and the referee for pointing out errors in an early 
preprint. The author is a Clay Prize Fellow and is supported by grants from the 
Sloan and Packard foundations. 



2. Littlewood-Paley projections and Strichartz estimates 

In this section we set out notation for two basic tools in this argument. 

We begin with Littlewood-Paley operators. If <f>(t, x) is a function in spacetime, we 
define the spatial Fourier transform 4>(t, f ) by 

kt,Cj := [ e- 2 ™ x <cf>(t 7 x) dx. 

Fix m(£) to be a non-negative radial bump function supported on |£| < 2 which 
equals 1 on the ball |£| < 1. For each integer k, we define the Littlewood-Paley 
projection operators P<k = P<k+i to the frequency ball |£| < 2 k by the formula 

and the projection operators Pk to the frequency annulus |£| ~ 2 k by the formula 

Pk '■= P<k — P<k- 

We also define more general projections Pk x <-<k 2 by 

Pk t < <k 2 '■= P<k 2 - P<k t - 
Similarly define Pk 1 <-<k 2 > e ^ c - 

Note that if is a smooth function which is equal to a constant e outside of a 
compact set, then we have the Littlewood-Paley decomposition 



<p = e ■ 

k 

Also, we remark that the Littlewood-Paley projections defined above commute with 
all constant-coefficient differential operators and are bounded on every Lebesgue 
space (including mixed- norm spacetime Lebesgue spaces). 

Because we are in the high-dimensional case n > 5, we will not need X s ' b -type 
spaces. Indeed, these spaces do not quite seem to be the right tool for dealing 



7 Basically, such quadratic expressions effectively improve the Sobolev space H n / 2 to the Besov 
space -B"^ 2 , which in principle can be treated by the arguments in | pg| . 
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with critical Sobolev regularity problems (despite being very powerful for subcrit- 
ical problems). Because we are avoiding these spaces, we may localize in time 
freely without encountering distracting technicalities involving the temporal Fourier 
transform. 

We now describe the Strichartz estimates that we need. Let us call a pair (q, r) of 
exponents admissible if 2 < q, r < oo and 

1 (n-l)/2 (n-l)/2 
g r - 2 

For any integer fc, we define the (H n > 12 -normalized) Strichartz space at frequency 
2 k , S f / t(R 1+n ), to be the space of functions on spacetime whose norm is given byQ 

U\\ Sh := S up2« + ^(||0|| i?L , +2- k \\d t ^\\ L , L r), 

q,r 

where the supremum ranges over all admissible exponents (g, r). Similarly define 
Sk(I x R") for time intervals I. Generally speaking, the large values of r are 
good for low-frequency terms, and conversely for high-frequency terms. In our high 
dimensional setting n > 5 we have a very large set of Strichartz estimates which 
will be more than adequate for our purposes. 

We shall only use specific values of q and r in our argument. More precisely, we 
observe that control of the Sk norm gives the estimates 



2r 2(n-l)/( 



LiL 



re Tire i Tire ,,, , 

,_ 3) <2~ "-+— \\0\\ Sk (12) 



IMUj£i<2-"|Mk (13) 

\\<p\y t L^< 2 ~^^u\\s h (14) 

IMU ?i ~ < 2"%|k; (15) 



4r,2( 



^-D <2-*-*G=V\\<f>\\ Sh (16) 



ll^lUr^ < ll^lls, (i7) 

\\4>\\l?l%< 2"* ||0||s, (18) 

note that we have used the hypothesis n > 5 in order to obtain the admissibility 
of (13) and (14). Clearly, one can also estimate the time derivative 4>t m the above 
norms by paying an additional power of 2 k . 

Suppose 4> is a smooth function and if) is a rough function. Our non-linearity is 
cubic in <f> and ip with two derivatives somewhere. To estimate a term such as 
0V0V*0 in L^L 2 ,, we shall usually estimate using (18) and <fi, using (15); 
this turns out to work as long as the term has equal or higher frequency to V0. 
The estimate (13) is useful for obtaining L\l? x control on terms such as S7ipVip(f> 
which are quadratic in the high frequencies, and the pair (14), (12) are useful for 
controlling terms such as 0V 2 0^>. Finally, the triplet (16), (16), (12) can handle the 
term V0V0'0. There is a certain amount of flexibility in our choice of exponents, 



8 The powers of 2 fe which will appear in the sequel are not mysterious, and can be explained 
by scaling. One should think of 2 k has having the units of frequency (i.e. inverse length), so that 
Sk has the scaling of H™^ 2 or Lj^LJ . 
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especially in large dimensions; in particular, the endpoint (12) can be avoided when 
n > 6. 

We have the Strichartz estimates (see e.g. || and the references therein): 

Theorem 2.1 (Strichartz estimates). Let k be an integer. For any function <f> on 
R + " with Fourier support on |£| ~ 2 k , we have 

Uh k < \\m\\H^m /2 - 1 + 2 ~ k+!t ^H^- 

Similarly if R 1+ " is replaced by I x R" for any interval I containing the origin. 

One could place U<f> in other spaces than L\L? X (indeed, one could use the dual of 
Sk, appropriately normalized) but we shall not need to do so here. 



3. The main proposition 



In this section we state the main proposition which will be used to prove Theorem 
|l.l| . Roughly speaking, the proposition asserts that if the H n / 2 norm of a (classical) 
wave map is initially small, then it stays small for all time, and its frequency 
profile does not change substantially. From this proposition and th e ex isting well- 



posedness theory it shall be an easy matter to obtain Theorem LI by general 
arguments. 



Throughout the paper we fix a to be a constant depending only on n such that 
< a < 1/2 (e.g. a := 1/4 will do). We also fix < e <C 1 to be a small constant 
depending only on n, m, a (e := 2~ 100nm will suffice). 

Definition 3.1. A frequency envelope is a sequence c = {ck} k£ z of positive reals 
such that we have the I 2 bound 

\\o\\p<s (19) 

and the local constancy condition 

2-^ k ~ k ^c k ' < c k < 2 CT|fe - fc V (20) 

for all k, k' € Z. In particular we have Ck ~ c' k whenever k = k' + 0(1). If c is a 
frequency envelope and {f,g) is a pair of functions on R™, we say that (/, g) lies 
underneath the envelope c if one has 

|| + 11^.911^/2-! <C k 

for all k e Z. 



Note that if (/, g) lies underneath an envelope c, then 

\\{f,9)\\H^H^<S- (21) 

Conversely, if ( |2l| ) holds (with slightly better implicit constants) then there exists 
an envelope c which / lies underneath, for instance one can take 

c k := Y, ^ k ~ k 'K\\Pk>f\\H^ + 11^.911^/2-0- (22) 
fe'eZ 
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Proposition 3.2 (Main Proposition). Let < T < oo, c be a frequency envelope, 
and 4> be a classical wave map on [0, T] x R" such that <fi[0] lies underneath the 
envelope c. Then, if e is sufficiently small, we have the bounds 

IWIIs fc ([0,T]xR") ^ C ° Ck ( 23 ) 

for all k £ Z, where Cq 3> 1 is an absolute constant depending only on n, m 



(i.e. independent of T , <f>, c, e). From (]23|), (18), and Definition 3.1 we have in 
particular that <f>[t] lies underneath the envelope CqC for all t £ [0, T\. 



We remark that the finiteness of T is needed for some continuity and limiting 
arguments to work, but otherwise T plays no role in the estimates. At first glance 
this Proposition seems to be merely a variant of a conservation law for the Ll n l 2 
norm, but the fact that the envelope c is arbitrary makes this result far more 
powerful; indeed, it effectively allows one to deduce the H n / 2 regularity theory 
from the H n / 2+ theory. Also, we remark that if we strengthen the I 2 control in 



91) to I 1 control, then Proposition 3.2 was essentially proven in p 



Assuming this Proposition, the proof of Theore m l l.ll is now easy. In light of the 
existing regularity and well-posedness results^] in |lq| (see also |2j| ) , as well as finite 
speed of propagation, it suffices to prove that the theorem when s is close to n/2, or 
specifically when n/2 < s < n/2 + a, and we may also assume by standard limiting 
arguments that <f> is a classical wave map. But if the initial data <f>[0] is in H s , 
then there exists an envelope c that <f>[0] lies under which decays like 2~ tyS ~ n / 2 ^ k as 



k — > oo (just by applying (|22|)). By Proposition 3.2, <fi[t] lies under the envelope 
Ccq for as long as the solution stays smooth, which in turn implies that the H s 
norm of cf>[t] stays uniformly bounded for as long as the solution stays smooth. The 
claim then follows from the regularity results in fl5|] , |29^] . The bound (||) follows 
easily as a by-product of the above argument (note that this bound also applies for 
s slightly below n/2). 



We close this section by performing some preliminary reductions for Proposition 



3.2. Fix (j>; we may assume that (f)(0) = e\ and dt<fr(0) = outside of a compact 
set. Since <f> is a classical wave map, the H n / 2 x H 71 / 2 ^ 1 norm of Pk4> decays faster 
than 2~l fe l as k — > ±oo. We may thus assume that c decays like 2 -CT l fc l as k — * oo. 



Fix c. We shall apply the continuity method. Let AC [0,T] be the set of all times 
T' C [0,T] such that @ holds (with T replaced by T 1 ). Observe that A is closed. 
Let A' C [0, 1] be defined similarly but with ( p3| ) replaced by the variant 

H P ^lls fc ([o,T]xR") < 2 ^oc fc (24) 

and with T replaced again by T' . Clearly A C A'. In fact, since <fi is a classical wave 
map we see that ||Pfc<^||s fc decays like 0(2~l fc l) as k — > ±oo, and we can therefore 
make the stronger statement that A is contained in the interior of A'. Also, A 

9 Strictly speaking, this regularity result was stated for data with small H s X H s ~ 1 norm, but 
the result extends as well to large data; see jlfj], [ juj . Alternatively, one can exploit finite speed 
of propagation and the fact (which is a variant of Poincare's inequality) that if has large H s 
norm, then the H s norm of on a small ball B(x, r) decays like a power of r for n/2 < s' < s, 
recalling that we allow a constant to be subtracted from <j> when defining the H a norm. 
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trivially contains 0. Thus if we can prove that A' C A, then A would be open, 
closed, and non-empty in [0, T], so that T S A. This will prove Proposition |3.2[ 

From the above discussion, we have reduced Proposition |3.2| (and Theorem [0]) to 

Proposition 3.3 (Reduced Main Proposition). Let c be a frequency envelope, < 
T < oo ; and let <f> be a classical wave map on [0, T] x R" such that <f>[0] lies 
underneath c, and that ( ^4j) holds for all k. Then ( ^3|) holds for all k (assuming 
that Cq is sufficiently large and e sufficiently small). 



The proof of Proposition 3.3 shall occupy the next few sections. In Section |4| 
we apply Littlewood-Paley projections to reduce (Q) to the linearized variant (|J)7 
modulo acceptable errors. In Section^ we develop rigorous versions of (^]), and thus 
convert (Q) to the anti-symmetric form (^). In Section ^ we use the approximate 
parallel transport scheme ([)]) to construct a good co-ordinate frame U, and in 
Section [?] we show how the transformation tp — Uw transforms (JsJ) into the free 
wave equation, modulo acceptable errors. 



4. Linearization 



Fix c, T, <f>, and suppose that the hypotheses of Proposition 3.2 hold. Henceforth 



all spacetime norms will be assumed to be on the slab [0, T] x R". 

Since the differential operator 2 _fc V is bounded on frequencies |£| ~ 2 k , we thus 
have that 

HV^Ik < V k C c k (25) 
for all k G Z and all j, with the implicit constant depending on j. 

We need to show (p3|). By scale-invariance (scaling T, c, and <fi appropriately) it 
suffices to show that 

U\\s„ < Coco, (26) 

where we define 

tp := P^- 

By applying Pq to (H) we obtain 

□V - -P (</>d a tfd a <f>). (27) 

We would like to convert this non- linear equation into the linearized equation (|J), 
modulo acceptable errors. To define "acceptable error" we introduce 

Definition 4.1. A function F on [0, T] x R" is said to be an acceptable error if 

\\F\\lIli < Cisco, 
and we shall write F — error to denote this. 
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Ideally we would like the entire non-linearity in (|27|) to be an acceptable error, as 



one could then use Theorem 2.1 to obtain fl26|). Although we cannot quite do this 
directly, we can show that almost all of the non-linearity is an acceptable error, 
and the remaining term can be renormalized by a suitable change of co-ordinates 
to also be acceptable error. 

We shall use <j> := P<_io0 to denote the regularization of <f>. Since <\> lies on the 
sphere, we thus have 

IMk-z- % M\\l T l~ = 1. (28) 
Also, from (15), (|2q), and the triangle inequality we observe the useful bound 

l|V0|| L?LS o <C e. (29) 

The main result of this section is then 
Proposition 4.2. We have 

P {(j)d a ^d a <p) = 24>d a ftd a 4> + error. (30) 



In particular, from (27) we have 

□V> = -2^>d a ^d a ip + error. (31) 

Proof We apply the Littlewood-Paley decomposition to split the expression inside 
the projection Pq in (^) as 

(P kl 0)(d a P k2 ^)(d a P k3 0) (32) 

^1,^2,^3 :max(/c2,^3)> 10, 1^2 — ^3 | <5 

+ £ (P kl 0)(d a P k2 ^)(d a P k3 0) (33) 

ki ,fc2^3 :max (^2^3)> 10, | &2 — &3 | >5 

+ £ (P kl <f>)(d a P k2 tf)(d a P k3 <f>) (34) 

+ 4>d a ftd a <t> (35) 

+ 0(a Q P_ lo <.<io0 t )(a Q P-io<.<io0)- (36) 

+ <M^ t (d Q P_ lo <.<io0) ( 37 ) 

f)d a (/). (38) 

As it turns out, all the terms except for (37), (38) will be of the form error. 

We first consider the high-frequency contributions (32), (33). We can rewrite (32) 



a.s 



£ {d a p k2 4> ] ){d a p k3 4>). 

k2,k 3 :msx{k2 ,k 3 )>W ,\k 2 — fc 3 | < 5 



To show that the contribution of this term is error, it thus suffices by ( 28 ) and the 
triangle inequality to show that 

£ \\{d a P k ^){d a P k M\LlLl < Cfce . 

k2,k 3 :max(k2,k 3 )>10,\k2 — fc 3 | <5 
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We use Holder to split L\L\ into two L\L% norms and apply (13), (|25|) to estimate 
the left-hand side by 

C c kl 2 k - 2- ~ " V C c k2 2* 3 2" * ~ 

k2,k 3 :max(k2,k 3 )>10,\k2— fe 3 |<5 

which by ( p(i| ) is bounded by 

<C 2 C 2 ^ 2 2M-+W-f). 
fc 2 >10 

But this is acceptable from our choice of a and the fact that cq < e. 

Now we consider (33). By symmetry it suffices to consider the contribution when 
&2 > &3 + 5, 10. In this case we may assume that \ki — &2| < 5 since the contribution 
to ([5(j|) vanishes otherwise. By the triangle inequality it thus suffices to show that 

ll(ft 2 -5<.< fe2+ 5^)(9 ce P fc2 t )(a Q P< fe2 -50)||L;^ <C 3 Q ec . 

k 2 >10 

We use Holder, splitting L\L\, L^L*, LfL^, and use (13), (17), ( psj ) and decom- 
position into projections P& to estimate the left-hand side by 

]T (C c fe 2-*-T)(C c fe2 2 fc2 2-*-T)(2fe) 

fc 2 >10 

which is acceptable by the same calculation used to treat (32). 



Now consider (34). We may assume that k\ < 15 since the contribution to (30) 
vanishes otherwise. We can thus simplify (34) as 

{P- 10<<15 
By Holder it thus suffices to show that 

||-P-io<.<i50||L-L2||a Q P<i O '?!)||L2 i? ,||9 Q P <1O ( ? !)|| i2i? , < C^ec . 
But this is immediate from (18), (p5f), and (a trivial modification of) (p^). 



The contribution of (35) to (30) is always zero, so we turn to (36). In light of (28) 
and Holder it suffices to show that 

||<9 q P_io<.<io0|Il2 £ 4 < CqECq. 

But this is immediate from (13), (p5[), and a breakdown into projections P&. 

The terms (37) and (38) are equal. We have thus shown that 

Po(4d a ^d a cf>) = 2P o (0cW t crP_ 1 o<.<io0) + error. 
Since tp — PoP_io<.<io</>, it only remains to show the commutator estimate 

\\P (m) - RPqWWlIl* < C$ec 
where R is the matrix R := 4>d a <ft and \& is the function \& :— <9 Q P_io<.<io</>. 



From (14), (25) and summing over Littlewood-Paley pieces we have 



|V a 0|| t?r »-i<Cbe. 
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A similar argument using (16), (|25|) gives 

IIV^H^^n-l) < C £. 

Combining these together with we obtain 

l|ViZ|| £f£ »-i <C 2 s (39) 

while from (12), ( p5| ) we have 

\\^\\ L 2 L 2( n -l)/( n -3) <CoCQ. 

Thus to finish the proof of this Proposition it suffices to use the standard commu- 
tator estimate (with p = n — 1, q = 2(n — l)/(n — 3), and r = 2) 

Lemma 4.3. We have 

llfl)(/fl)-/ft(ff)llr<||V/||p|M|, (40) 

for all smooth functions f , g on R" and all 1 < p, q, r < oo such that 1/p + 1/q = 
1/r. 

For a previous application of this type of lemma to wave maps, sec [juj . 
Proof We begin with the identity 

W<?)(*) ~ fPo(g)(x) = / m(y)(/(x -y)- f(x))g(x - y) dy 



and use the Fundamental theorem of Calculus to rewrite this as 

/ fh(y)y ■ V/(x - ty)g(x ~ y) dy dt. 
o J 

Since rh(y)y is integrable, the claim then follows from Minkowski and Holder. 



We have thus transformed the non-linear equation (|l|) into the linear equation ( 31 ) 
(modulo a forcing term which can be dealt with). This procedure appears to be 
quite general, and does not rely on the specific form of (|l|). In principle, the linearity 
conveys several advantages, for instance we can now apply Duhamel's principle 
to remove the inhomogeneity, or the principle of superposition to decompose ip 
into wave packets or similar objects. However, we shall not exploit linearity in 
any significant way; for us, the main advantage of (|3l]) is that there is only one 
moderately high-frequency term in the non-linearity, with the other two factors 
being very smooth. Also, the linearity allows us to separate the role of the small 
quantities e and cq; ip will always be estimated with a bound of Co, whereas 4> will 
be estimated with a factor of e if it is high-frequency or is accompanied with a 
derivative, or by a constant otherwise. 

The main term in the right-hand side of (|3l]) cannot be placed in L]L 2 by Strichartz 
estimates; in fact the failure of the Strichartz estimates here is quite dramatic 
(more than a logarithm). Roughly speaking, this is because the derivatives in this 
term could fall on high-frequency components of 4> rather than low frequency ones. 
However, after the renormalization we will be able to move all the derivatives onto 
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low frequency terms, allowing Strichartz estimates to successfully place the non- 
linearity into L]L 2 . 



5. The tangent space of the sphere 



We now develop the analogue of (|5|). 
Proposition 5.1. We have the estimate 

\\ftd a ^\\ L 2 Ll < Cisco (41) 

Proof We first observe that ( f4l| ) is easy to show if the derivative is moved to the 
low frequency term: 

\\(d a 4>^\\ L 2 Ll <C 2 ec . (42) 

Indeed, we simply place ip in LfL 2 . using (18), ( p5j ) and d a 4> in L\U^ using (15), 
(p5|). From this and the product rule it thus suffices to show that 

\\d a (4>^)\\ L 2 Ll <C 2 sc . 

Consider the expression Po(^0). Heuristically, this quantity is approximately 
2(fti[>, by the same type of reasoning used to obtain the linearization (|J). On 
the other hand, since (j) lies on the sphere, Po(</>t</>) = -Po(l) = 0. Accordingly, we 
shall rewrite the above estimate as 

\\d a {P Q {^4>) - 2j>H)\\ LlLi < C 2 ec . (43) 

We now split (jy(j) as 

(P > _io^ t )(P > -io0) + ~4>HP- W< . <w 4>) + (P-io<.<io<^ 
plus other terms which vanish when Pq is applied. 

The first term of the above expression can be refined to 

E (P fel 0t)(P fc2 0) 

fci ,fc 2 > — 10:|A:i — fc 2 | <20 

since all the other components vanish after applying Pq. The contribution of this 
term to (O) can thus be estimated using the product rule and Holder by 



< E E \\d a PkML T Ll\\PkML^ + \\PkML^\\d a PkML ? Ll. 

fci,fc 2 >-10 fe 2 :|fci-fe 2 |<20 

Applying (15), (18), (p3), this can be bounded by 

< E 2 kl C c kl 2- nkl/2 C c kl 2- k ^ 2 

fci>-10 

which is acceptable by (^0|) and our choice of 5. 

The other two terms are equal to each other. It thus remains to show the commu- 
tator estimate 

\\d a (P a (4>^) - 0tp oW )|| i?L2 < c 2 ec 
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where W := f-io<.<io0- Since the expression inside the d a has Fourier support on 
l£l i5 1? we ma y discard the derivative d a . 



By Lemma 4.3 we may estimate the left-hand side of this by 

II V0iu» £ «. , 

and this is acceptable by the argument used to treat ([|2]) (as the commutator 
estimate has effectively moved the derivative from the high-frequency term to the 
low- frequency term; cf. the "I- method" trick in jnJ, jy], Q). ■ 

As a particular cor ollary of @, @, and Holder we have 

d a 4><ft d a ip = error 
and so from ([H]) we have the analogue 

Dip = 2A a d a tJj + error (44) 
of (^|), where A a is the anti-symmetric m x m matrix 

A a := d a (jffi - <f>d a ft. 

The equation ((44J) can be derived in an alternate way, which we now sketch. By 
differentiating (|l|) we have 

= -2(j)d a ^d a V x ,t4> ~ V<f> x , t d a tfd a 4>. 
Since <fi lies on the sphere, we have (jr^J x ,t<t> — 0, so we can rewrite this as 
□V x , t = -2A Q , oo a Q V x , t - ^ x .t<pd a ^d a <t> 

where 

If one then applies a Littlewood-Paley projection Pk to this equation then one can 
obtain (the derivative of) (fyj) by the same type of calculations as in the previous 
section. Note that the second term in the above equation is cubic in V ' x ,t4> an d 



so can be treated by Strichartz estimates since n > 3. This derivation of (44) 
works well in high dimensions but is difficult to use in low dimensions, especially 
n = 2, because one only has n/2 degrees of regularity, and so we cannot afford to 
differentiate the equation as above. 



6. Approximate parallel transport 



We now construct a matrix field U which is approximately orthogonal and which 
will renormalize ( |44| ) into a much better form, namely Dw — error. 

We shall use the scheme described in the introduction. More precisely, we let M be 
a large integer (depending on T!) to be chosen later and define the real m x m- valued 
matrix field U by 

U:=I+ Uk 

-M<fc<-10 
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where / is the identity matrix and the U k are defined inductively by 

U k := ((P k <j>)(P <k ^) - (P <k cf>)(P k ^))U <k (45) 

and 

U <k :=I+ Yl U «- 

-M< k '< k 

An easy inductive argument shows that U <k has Fourier support on the region 
{|£| < 2 fc+5 }, and thus that U has Fourier support on the region {|£| < 2~ 5 }. More 
generally, one expects U <k to have essentially the same estimates as P< k 4>, using 
the heuristic that the factors P< k <ft and U <k in ( [45| ) are bounded and therefore do 
not significantly affect the estimates. 



We now quantify the precise estimates on U which we shall need. 

Proposition 6.1. Assume that e is sufficiently small depending on Co, and M is 

sufficiently large depending on T, Co, e. Then we have the almost orthogonality 
property 

\\U*U - I\\ LrL ~,\\d t (UiU - <Cle. (46) 

In particular, if e is sufficiently small depending on Co , then U is invertible, and 

WuhrL-Alu-'hrL- <i- (47) 

Also, we have the approximate parallel transport property (cf. fffij) 

\\d a U - A a U\\ L i L oo <C 2 £ (48) 
as well as the additional bounds (needed to control error terms) 

\\d a U\\ L?L ~ < Cle (49) 

\\dJJ\\ LlL7 < Cle (50) 
\\^U\\l^ <C 2 e (51) 

for all a. 



The power of e is not sharp in many of these estimates, but that is irrelevant for 
our purposes. Interestingly, the above bounds on U do not seem to easily extend 
to any useful continuity estimates on the map <fi h- > U . In particular, small H n / 2 
perturbations in <fi can lead to large fluctuations in U in L°° . This phenomenon is 
the major obstacle to obtaining a critical Sobolev regularity well-posedness theory 
from our arguments, and is also a problem in obtaining scattering even for classical 
wave maps. 

Proof As noted in the introduction, we have the identity 

ulu<k + u{ k u k = 

whence 

u\ K u <K -i= u l Uk ( 52 ) 

-M< k <K 

for all —M <K<-9. 
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We now show inductively that 

\\U <k \\l T l~ < 2 (53) 

for all — M < K < —9. This is clearly true for K — —M. Now suppose that 
K > — M and the claim has been proven for all smaller K. Then from (^5|) and 
Holder we have 

\\Uk\\h^L^ < \\Pk(l>\\Lf>L%>\\P<k(l)\\L<*>L%> 

for all —M < k < K. By (17), (||), and (§§) we thus have 

and the induction ( |53| ) can thus be closed by (52) and (|l9|), if e is sufficiently small. 

From the above analysis we see that the first part of ( jiff ) obtains, as does the first 
part of (fl7|). In particular WU, and thus U, are invertible, and this gives the 
second part of (|47|). 



We now show (]49|), (£50|); the second part of (|6|) will then follow from (|47]), (|9|), 
and the product rule. 

We shall again use induction, showing that 

\\d a U <K \\ L? > LS > < C\2 K C^c K (54) 

and 

\\d a U <K \\ L 2 L ^ < C^^CIck (55) 
for all — M < K < — 9 and some sufficiently large absolute constant C\ ^> 1. 

The claim is trivial when K = —M. Now suppose that K > M and the claim has 
been proven for all smaller K. By differentiating ([jEJ) and using Holder, we obtain 

\\d a U K -i\\L^L^ <\\d a PK-i4>\\L^ l%>\\P<k-i4>\\l™ l^\\U<:k-i\\l^l^ 

+ \\PK-lHLf L^\\d a P<K-l(l>\\L^ L^\\U <K -i\\l^ 
+ \\PK-l^\\L T L~\\P<K-l<j>\\L T L~\\d a U <K -l\\L T L T 

and 

\\d a U K -x\\LlL%> %\\daPK-xHLlL^\\P<K-\())\\Lf'L^\\U < K-l\\L'»L^ 

+ \\PK-l$\\L?>L™\\d a P < K-l<f>\\L 2 t L^\\U < K-l\\Lf>L™ 
+ \\PK-l<l>\\L^Lf\\P<K~l<P\\L^L^\\d a U < K-l\\L 2 t L^- 

By applying (17), (15), (p5[), (|47|), and the induction hypothesis we thus see that 
\\d a U K -i\\L?Ls- £ 1 K Clc K {\ + de) 

and 

\\d a U K -i\\ L 2 L ~ < 2 K ' 2 Clc K {l + Che). 

If e is sufficiently small and C\ is sufficiently large depending on Co then one can 
close the induction hypothesis. This gives ([49|), (p0|). 

Next, we prove (^). We can write A a as A a .<-io where 

4,<fc := A«, <fe+ i := (d a P< k (p)P<k^ - [P<k<f>)(d a P<k<t>? ■ 
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We thus have the telescoping identity 

d a U-A a U = [ d a U k - (A a ,< k U< k - A a . <k U <k )} - A a .< 

-M <fc<-10 

To estimate A q <_m, we use (17), (pq) to obtain 



\A ai <-M\\L^L^ < 2 



-M 



This term is thus acceptable by Holder in time if M is sufficiently large depending 
on T, Co, e. 

By (|l^) and the triangle inequality it thus suffices to show that 

\\d a U k - (A at < k U< k — A a ^ <k U <k )\\ L i L ^ < CqcI 
for all —M < k < -10. 

We expand out 

d a U k = ((d a P k ^)(P <k ^) - {P <k <f>)(d a P k tf))U <k (56) 
+ ((P k <p)(d a P <k tf) - {d a P <k 4>)(P k ^))U <k 

+ ((p k ^){p <k ^) - (P <k <f>)(P k ^))d a u <k 

and 

A a ,< k U< k - A a , <k U <k = {(d a P k 4>){P <k( j>t) - {P <k 4>){d a P k 4> f ))U <k (57) 

+ {(d a P< k 4>){Pk^) ~ (P k <P)(d a P< k ^))U <k 

+ {{d a P< k 4>)(P< k $) - {P< k (£){d a P< k $))U k . 

In both expressions, the dangerous terms (56), (57) occur when the derivative falls 
on a high frequency term P k (f> instead of a low frequency term such as P< k (f), P< k 4>, 
U <k . (Indeed (57) is the only reason why A a fails to be in L\L^ , and is the only 
reason why we need a renormalization by U in the first place). Fortunately, we 
have chosen U so that the dangerous terms (56), (57) cancel each other. From the 
triangle inequality it thus suffices to show the bounds 

|||p^||9«p< fc 0||c/< fe ||| i i L oo,|||p fe 0||p< fe( /)||a Q [/ <fc ||| i i iS c, 

(58) 

ll|9aP<fc^||^||^< fc ||| L i L »j||a Q p< fc ^||p< fc ^||y fc ||| L i L „ <clc\. 



The first term in (pq) is acceptable by (15), (25) for the first two factors (dyadically 



decomposing the latter factor and using (20)) and (M7|) for the last. The second 
term is acceptable by (15), ( pq 1 ) for the first factor, ( p8| ) for the second, and fl55| ) 
for the last. The third term is treatable by the same argument as the first term. 



Finally, the fourth term is acceptable by (15), ( pq ) for the first term, (28) for the 
second term, and the estimate 

\\U k \\h\L~ <2- fe / 2 C c fe , 

which can be proven from ( fl5| ) and estimating P k (j> using (15), (p5|), P< k (f> using 
and U <k using @. 

The only remaining estimate to prove is (^l]). In principle this is the same type of 
estimate as (p9), but there is a minor complication arising from the double time 
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derivative in □, which are not directly treatable by the Sk norms. To get around 
this we will have to use the equation ([!]). 

More precisely, we shall need 
Lemma 6.2. For all k, we have 

< 2 fe ^-^C 3 c fc . (59) 



Proof Morally speaking this estimate obtains from (14), ( p5| ) if we treat time 
derivatives like spatial ones. We could have modified our Littlewood-Paley opera- 
tors to project in time as well as space in order to make this heuristic rigorous, but 
this creates other difficulties having to do with time localization which we wished 
to avoid. 

We shall show J59| ) for k = to simplify the exposition; the reader may verify that 
the argument below is scale invariant and thus extends to all k. 

Applying (|lj), we see it suffices to show that 

\\P ((j>d a tfd a <l>)\\ LlL 2-i <C$c . 

Let us first consider the contribution of 

0(9 Q P>50 t 9 Q P>io0. 

In this case it suffices by Bernstein's inequality (or Young's inequality) to obtain 
LjL^ estimates. From ( p5| ) and the definition of the Sk norm we have 

\\d a Pk<t>\\ Lt Li <C c k 2-^W\ 
From ( p0| ) and the assumptions on a we thus have 

\\daP>5<t>\\L*L%, \\9 a P>w4>\\LfL* ^ CoCo 

and this contribution is thus acceptable by (p8|). 

Now consider the contribution of 

(j)d a P< 5 <ft d a P >W (f>. 

In this case we can replace the first factor by P>5<^, since the error in doing so 
vanishes after applying Pq. We now modify the above argument, the only difference 
being that we now place the first term in L\L\ and the second in LfU^ . In fact, 
the summation is much better because the derivative is now on the low frequency 
term. 

A similar argument deals with 

(f)d a P >5 4> ] d a P< W (t> 

and so we are left with 

10 <P- 

We can split this into 

P- 5 < ■ < 1 5 0d Q P< 5 (9 a P< io 
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-P< - 5 <9„ P_ 5 < . < 5 T <9 Q P< i o </> , 

and 

P< - 5 4>9 a P< - 5 f d a P_ 5 < . < 1 <j> , 

plus some other terms which vanish when Pq is applied. 

For each of these three terms we place the high frequency (> —5) factor in L\ L™ -1 
and the other two factors in Regardless of the position of the derivatives, 

the high frequency factor has a norm of 0{CqCq) by (14), (p5|). Of the other two 
factors, both are bounded by 0(1) by (|2|), and at least one contains a derivative 
and therefore has a norm of 0(Cqe) by (17), (|25|). The claim then follows (if e is 
sufficiently small depending on Co). ■ 



We now prove (|5lj). As before, we shall use induction and in fact prove 
for all —M < K < -10 and some large C 2 > 1. 

The claim is trivial for K = —M. Now suppose that K > M and the claim has been 
proven for all smaller K. We apply □ to (^) and take absolute values (ignoring 
any possibility of cancellation) to obtain 

\UU K ^\ <\ P K-1<P\\P<K-1<P\PU <K _ 1 \ + \P K -l^\pP<K-im<K-l\ 

+ |nPif-i0||P<x-i^||C/<K-i| + |V B| tPjc_ 1 0||V B , t P< Jf _i^||l7 < x-i| 

+ |V K)t PK-i^|P<K-i0||V x , t f7<*:_i| + |Pjf-i^||V Xlt P<jr-x^||V ie , t l7<A-_i| 

We will show that all six terms on the right-hand side have an L^L™ -1 norm of 

<2 K( - 2 -i-^i) C *e(l + C 2 e) 1 
so that we can close the induction if e is sufficiently small and C 2 sufficiently large. 



For the first term we use (17), ( p5| ) for the first factor, (|2S]) for the second factor, 
and the induction hypothesis for the third factor. For the second and third terms 



we use Lemma 6.2 for the 0<f> term, and place the other two terms in LfU^ using 
(p8|), (47). For the remaining three terms we place the first factor in L^L" -1 using 
(15), (25) and the other two factors in LfL^ using (17), ( p5| ) for the second factor 
and <pm for the third. ■ 



7. Wrapping up 



Armed with Pro posi tion 3.1 we can now conclude the proof of (|26| ) and thus of 
Propositions 3J3 and ^2. Since U is invertible, we may write ip = Uw for some w, 
which is smooth by our assumptions. By (|47]), ( ^9|) and the Leibnitz rule in time 
we have 

HV'lk < IHk 

and so it suffices to show that 



HI So < C C . 



(60) 
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We expand (|4J) using the Leibnitz rule as 

UUw + 2d a Ud a w + {DU)w = 2A a Ud a w + 2A a {d a U)w + error. 

By (|47| ) we see that U~ 1 error = error, thus we can rewrite the previous as 

□to = -IV^idJJ - A a U)d a w + 2U- 1 A a {d a U)U- 1 ijj - C/ _1 (nJ7)[/" V + error. 

(61) 

We now show that all terms on the right-hand side are of the form error. 

To control the first term, it suffices by ([l7|), ( f48| ) (if e is sufficiently small depending 
on Co) to show that 

\\d a w\\ LrL 2 < Coc - 

Since tp — Uw and d a ip = Ud a w + (d a U)w, we have 

d a w = U- X d a i) + U-^d^p-^. 

The claim then follows from @, (18), (|2|) (for the first term) and @, @, (18), 
( p5| ) (for the second term). 

Now consider the second term in (|6l|). By (^8|), (]2^) we have 

\\A a \\ L 2 LT <C e 

and from (15), (18) we have 

H\\LrLi<c co. 

The claim then follows from ([It]) and (^p|). 

Now consider the third term in ([3l]). By (12), ( p5| ) we have 

ll^lll,fi2Cn-l)/(«-3) < Coco. 

The claim then follows from (|l]) and (ff7|). 

We have thus shown that Uw = error, or in other words that 

\\ aw \\L}Li <C$ec 
Also, from (|47]) and the assumption on ^[O] we have 

IH0]||w <cq. 



At this point one should be able to obtain (|60|) from Theorem \2.1\ , however w is not 
quite supported on the frequency annulus |£| ~ 1 (we have Fourier support control 
on U, but not on U - 1 ). However, we can apply P„io< <io to the above estimates 
and use Theorem 2.1 to conclude that 



||-P-iO<-<i(H|so < Coco 



(62) 



(providing that Co is sufficiently large depending only on n, m, a, and e is suffi- 
ciently small depending on Co). 



To pass from (62) to ( |60| ) we begin with the identity 

w = tfip - (U*U - I)w. 
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From the Fourier support of U and ip we see that U'i/> has Fourier support in the 
annulus 2~ 5 < |£| < 2 5 , which implies that 

(1 - P-io<-<io)w = -(1 - P-io<-<io)(U^U - I)w. 

Taking Sq norms of both sides we see that 

||(l-P_io<.<io)H|sr a £ \\(U^U - I)w\\ Sa . 
From ([46|), and Holder we thus obtain 

||w - P-io<-<iow||s < C 2 e 2 ||u;||so 



and (|60|) then follows from (J62J) if Co is sufficiently large and e is sufficiently small. 
This concludes the proof of Proposition [T^, and thus of Theorem |l.l| . 
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